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Abstract. This is just a short proof of Kruskal's theorem regarding uniqueness of expressions 
for tensors, phrased in geometric language. 



Let A,B,C be complex vector spaces of dimensions a, b, c. Consider a tensor T E B f^iC 
and say we have an expression 

(1) T = Ui <S Vi Wi + • ■ ■ + Ur Vr <Si Wr 

where uj € A,Vj € B, Wj G C, and we want to know if the expression is unique up to re-ordering 
the factors (call this essentially unique). The rank of T is by definition the smallest such r such 
that T admits an expression of the form ([T]). For the tensor product of two vector spaces, an 
expression as a sum of r elements is never unique unless r = 1. Thus an obvious necessary 
condition for uniqueness is that we cannot be reduced to a two factor situation. For example, 
an expression of the form 

T = ai (8) 61 (g) ci + ai (g) 62 (X) C2 + as (8) 63 (8) C3 + . . . + Or (8) 6r (8 Cr 

where each of the sets {cj}, {bj}, {c^} are linearly independent is not unique because of the first 
two terms. In other words if we consider for ([T]) the sets Sa = {[ui]} C P^, Sb = {[vi]} C FB, 
Sc = {[wi]} C PC each of the sets must consist of r distinct points. 

We recall the classical fact: 

Proposition 1. Let n > 2. Let T € Ai^ • • • (8 An have rank r. Say T E A'^<^ • • • (8 A'^, where 
A'- C Aj, with at least one inclusion proper. Then any expression T = '^21=1 • • • 8" with 
some Uj A'g has p > r. 

Proof. Choose complements A'^ so At = A'^QA^. Assume p = r and write Uj = n*' + n*" 
with Uj' E A't, n*" E A". Then T = X]f=i "^l'^ ■ ■ ■ ® u^' so all the other terms must cancel. 
Assume p = r, and say, e.g., some u^ ' / 0. Then X;5=i u]" (n^ O • • • (8 uf) = 0, but all the 
terms {u^' ® • • • (8 u"') must be linearly independent in A'2® • • • 18 A'^ otherwise r would not be 
minimal, thus all the u}j" must all be zero, a contradiction. □ 

Definition 2. Let S = {xi, ...,Xp} C PH^ be a set of points. We say the points of S are in 
2-general linear position if no two points coincide, they are in 3-general linear position if no 
three lie on a line and more generally they are in r- general linear position if no r — 1 of them lie 
in a P*""^. We let the Kruskal rank of S, kg, be the maximum number r such that the points of 
S are in r-general linear position. 

If one chooses a basis for W so that the points of S can be written as columns of a matrix 
(well defined up to rescaling columns), then kg will be the maximum number r such that all 
subsets of r column vectors of the corresponding matrix are linearly independent. (This was 
Kruskal's original definition.) 
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Theorem 3 (Kruskal,[I]). Let T ^ A® B ® C . Say T admits an expression T = Yl\=i Ui®Vi®Wi. 
Let Sa = {[u^]},Sb = {[n-^},Sc = {[wi]}. If 

(2) r<]^{ks^+ks^+ksa)-l 

then T has rank r and its expression as a rank r tensor is essentially unique. 

Above, we saw a necessary condition for uniqueness is that ks^ , ksg , ks^ > 2 and it is an easy 
exercise to show that if ([2]) holds, then fc^^, kgg, k^^ > 2. (Hint: a priori kg^ < r.) 

Note that if a = b = c and T : {A(^ B)* C and similar permutations are surjective, then 
it is very easy to see such an expression is unique when r = a. Kruskal's Theorem extends the 
uniqueness toa<r<|a— 1. 

The key to the proof of Kruskal's theorem is the following lemma: 

Lemma 4 (Permutation lemma). Let W he a complex vector space and let S = {pi, 
S = he sets of points in ¥'W and assume no two points of S coincide (i.e., that 

kg >2) and that (S) = W . If all hyperplanes H C PVF that have the property that they contain 
at least dim {H) + 1 points of S also have the property that #(<S fl H) > fl H), then S = S. 

If one chooses a basis for W = C" and writes the two sets of points as matrices M, M, then the 
hypothesis can be rephrased (in fact this was the original phrasing) as to say that for all x € C" 
such that the number of nonzero elements of the vector *Mx is less than r — rank(M) + 1 also 
has the property that the number of nonzero elements of the vector *Mx is at most the number 
of nonzero elements of the vector *Mx. To see the correspondence, the vector x should be 
thought of as point of W* giving an equation of H, zero elements of the vector *Mx correspond 
to columns that pair with x to be zero, i.e., that satisfy an equation of H, i.e., points that are 
contained in H. 

Note a slight discrepancy with the original formulation: we have assumed (S) = W so 
rank(M) = n. Our hypothesis is slightly different, but it is all that is needed by Proposi- 
tion [TJ Had we not assumed this, there would be trivial cases to eliminate at each step of our 
proof. 

Proof. First note that if one replaces "hyperplane" by "point" in the hypotheses of the lemma, 
then it follows immediately as the points of S are distinct. The proof will proceed by induction 
going from hyperplanes to points. Assume {k + l)-planes M that have the property that they 
contain at least k -\- 2 points of S also have the property that ^(S O M) > i^{S D M) and we 
will show the same holds for /c-planes. Fix a /j-plane L containing > /c + 1 points of S, and 
let {Ma} denote the set of k + 1 planes containing L and at least /U + 1 elements of iS. We have 

#{S n L) + ^ #(5 n iMa\L)) = R 

a 

#{S n L) + ^ #(5 n (M„\L)) < R 

a 

the first line because every point of iS not in L is in exactly one Ma and the second because 
every point of S not in L is in at most one Ma- Rewrite these as 

{#Ma - 1)#(5 n L) - ^ #(5 n Ma) = -R 

a 

{#Ma - 1)#(5 n L) - ^ #(5 n Ma) > -R 

a 

But by our induction hypothesis #{S H Ala) > #{<S ri Ma) so putting the two lines together, 
we obtain the result for L. □ 
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Proof of theorem. Given decompositions = Yl^j=i % ® '"j ®Wj,(j) = Yl'^j=i % ® '^j ® '"^i length 
r we want to show they are essentially the same. (Note that if there were a decomposition ^ of 
length e.g., r — 1, wc could construct from it a decomposition of length r by replacing ui(^vi® wi 
by ^ni ^ Si ® iwi + ^ui (^vi(^wi, so uniqueness of the length r decomposition impHcs the rank 
is r.) We first show Sa = <Sa,<Sb = <Sb,<Sc = -Sc- By symmetry it is sufficient to prove the last 
statement. By the permutation lemma it is sufficient to show that if C PC is a hyperplane 
such that Ci H) > c — 1 then #(5c n i/) > n H) because we already know kg^, > 2. 

Recall the classical fact about matrices (due to Sylvester): if M G ^4(8)5 and U C A, V C B, 
then 

rank(M) > rank(M|[;±x^*) + rank(M|^*^yx) — Tank{M\^±y^y±). 
Let Ah := {uj \ [wj] ^ H), Bh {vj \ [wj] ^ H) 
#(4 ^ i?) > rank(r(ii-^)) 

> rank(r(i7^)U^x^B0 + rank(T(i7^)U.^B^x) - rank(r(i7^)U^x^s,^) 

> min(A;A, #{Sc t H)) + min(A;B, #{Sc t H)) - #{Sc t H) 

where the last line follows by the definition of Kruskal rank. Finally we need to show that 
#((Sc ^ H) < mm(kA, ks)- But this follows because 

r - #(5c (tH) = #iSc cH)>c-l>kc-l>2r-kA-kB + l 

i.e., kA + ks — #{Sc <t H) > r + 1, which can only hold if #((Sc ^ H) < min(/s^, ks)- 

Now that we have Sa = Sa etc.. , say we have two expressions 

T = Ui ® Vi ® Wi + ■ ■ ■ + Ur ® Vr ® Wr 

T = Ui (g) ^^.(i) (g) Wt-(i) H h Itr (g ^o-(r) ^ '■'VCr) 

for some a, r G Gr- First observe that if a = r then we are reduced to the two factor case 
which is easy, i.e., if T G ^®-B of rank r has expressions T = ai (g) 6i + • • • + (g 6r and 

r = ai (g) 6cr(i) + h Or <g) b(T(r)) then it is easy to see that a = Id. 

So assume a ^ t, then there exists a smallest jo G {1, ...jt} such that (7(jo) =: sq ^ Iq := 
r(jo). We claim there exist subsets S,T C {1, ■■■,r} with the properties 

• So e S,to e T, 

• SnT = 0, 

• #{S) <r-kss + l, #(r) <r-ksa + l and 

• (-yj I j G 5"=) =: Hs C -B, (wj I j G T'') =: Ht C C are hyperplanes. 
Here = {!,..., r}\5. 

To prove the claim take a hyperplane Ht C C containing Wsq but not containing wt^, and 
let T'" be the set of indices of the Wj contained in Ht, so in particular #(T'^) > k^^ — 1 
insuring the cardinality bound for T. Now consider the linear space {vt \ t (z T) C B. Since 
#(T) < r — + 1 < — 1 (the last inequality because ksj^ < r), adding any vector of Sb 
to (vt I t G T) would increase its dimension, in particular, VgQ ^ {vt \ t T). Thus there exists a 
hyperplane Hs C B containing {vt \ t G T) and not containing VgQ- Let S be the set of indices 
of the Vj contained in Hg- Then 5, T have the desired properties. 

Now by construction Tl^^x^^^x = 0, which implies there is a nontrivial linear relation among 
the Uj for the j appearing in S fl T, but this number is at most min(r — ksg + 1, r — ks^ + 1) 
which is less than fc^^. □ 

Remark 5. There were several inequalities used in the proof that were far from sharp. In fact, 
Kruskal proves versions of his theorem with weaker hypotheses designed to be more efficient 
regarding the use of the inequalities. 
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Remark 6. The proof above is essentially Kruskal's. The reduction from a 16 page proof to the 
2 page proof above is mostly due to writing statements invariantly rather than in coordinates. 

More generally, Kruskal shows that for d factors, if Yli=i > 2r + d — 1 then uniqueness 
holds. 
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